Search CORE

Cold Spring Harbor Laboratory Institutional Repository

eScholarship - University of California

D-Scholarship@Pitt

FigShare

Comparative analysis of RNA sequencing methods for degraded or low-input samples

Author: A Roberts
Aaron M Berlin
AL Beyer
Alec Wysoker
Andreas Gnirke
Andrey Sivachenko
Aviv Regev
B Langmead
B Li
BE Maden
C Trapnell
D Aird
D Ramsköld
David S DeLuca
Dawn Anne Thompson
Diego Borges-Rivera
DS DeLuca
F Tang
G Giannoukos
H Aviv
H Li
H Yi
JD Morlan
Joshua Z Levin
JZ Levin
L Yang
M Griffin
MA Tariq
Michele A Busby
Nathalie Pochet
R Huang
R Rosenkranz
Rahul Satija
S Islam
Timothy Fennell
TR Dreszer
X Pan
Xian Adiconis
YH Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2013
Field of study

available in PMC 2014 January 01RNA-seq is an effective method for studying the transcriptome, but it can be difficult to apply to scarce or degraded RNA from fixed clinical samples, rare cell populations or cadavers. Recent studies have proposed several methods for RNA-seq of low-quality and/or low-quantity samples, but the relative merits of these methods have not been systematically analyzed. Here we compare five such methods using metrics relevant to transcriptome annotation, transcript discovery and gene expression. Using a single human RNA sample, we constructed and sequenced ten libraries with these methods and compared them against two control libraries. We found that the RNase H method performed best for chemically fragmented, low-quality RNA, and we confirmed this through analysis of actual degraded samples. RNase H can even effectively replace oligo(dT)-based methods for standard RNA-seq. SMART and NuGEN had distinct strengths for measuring low-quantity RNA. Our analysis allows biologists to select the most suitable methods and provides a benchmark for future method development.National Institutes of Health (U.S.) (Pioneer Award DP1-OD003958-01)National Human Genome Research Institute (U.S.) (NHGRI) 1P01HG005062-01)National Human Genome Research Institute (U.S.) (NHGRI Center of Excellence in Genome Science Award 1P50HG006193-01)Howard Hughes Medical Institute (Investigator)Merkin Family Foundation for Stem Cell ResearchBroad Institute of MIT and Harvard (Klarman Cell Observatory)National Human Genome Research Institute (U.S.) (NHGRI grant HG03067)Fonds voor Wetenschappelijk Onderzoek--Vlaandere

DSpace@MIT

Examples of sequence conservation analyses capture a subset of mouse long non-coding RNAs sharing homology with fish conserved genomic elements

Author: A Pauli
AJ Vilella
AN Khachane
AR Quinlan
B Bánfai
C Camacho
C Carrieri
C Trapnell
CJ Brown
D Licastro
DA Hosack
DR Kelley
DW Huang
DW Huang
Ferenc Müller
G Bejerano
GA Calin
H Jia
I Ulitsky
IA Qureshi
J Ponjavic
J Sheik Mohamed
J-W Nam
JL Rinn
JM Silva
JN Hutchinson
JP McCutcheon
KC Pang
KS Pollard
KS Pollard
L Duret
L Hui
L Kong
LA Pennacchio
M Aoyama
M Guttman
M Lin
ME Dinger
ME Dinger
MN Cabili
NR Zearfoss
NT Ingolia
P Carninci
P Flicek
P Flicek
P Flicek
PP Amaral
PP Amaral
R Arrial
RA Chodroff
Remo Sanges
S Haider
S Katayama
S Washietl
SE Seemann
SJ Hubbard
SR Eddy
Swaraj Basu
T Fawcett
T Gesell
T Kino
T Ota
T Sing
T-K Kim
TR Dreszer
TR Mercer
TR Mercer
UA Ørom
Y Okazaki
Y Sakuraba
Y Zhou
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Background: Long non-coding RNAs (lncRNA) are a major class of non-coding RNAs. They are involved in diverse intra-cellular mechanisms like molecular scaffolding, splicing and DNA methylation. Through these mechanisms they are reported to play a role in cellular differentiation and development. They show an enriched expression in the brain where they are implicated in maintaining cellular identity, homeostasis, stress responses and plasticity. Low sequence conservation and lack of functional annotations make it difficult to identify homologs of mammalian lncRNAs in other vertebrates. A computational evaluation of the lncRNAs through systematic conservation analyses of both sequences as well as their genomic architecture is required.Results: Our results show that a subset of mouse candidate lncRNAs could be distinguished from random sequences based on their alignment with zebrafish phastCons elements. Using ROC analyses we were able to define a measure to select significantly conserved lncRNAs. Indeed, starting from ~2,800 mouse lncRNAs we could predict that between 4 and 11% present conserved sequence fragments in fish genomes. Gene ontology (GO) enrichment analyses of protein coding genes, proximal to the region of conservation, in both organisms highlighted similar GO classes like regulation of transcription and central nervous system development. The proximal coding genes in both the species show enrichment of their expression in brain. In summary, we show that interesting genomic regions in zebrafish could be marked based on their sequence homology to a mouse lncRNA, overlap with ESTs and proximity to genes involved in nervous system development.Conclusions: Conservation at the sequence level can identify a subset of putative lncRNA orthologs. The similar protein-coding neighborhood and transcriptional information about the conserved candidates provide support to the hypothesis that they share functional homology. The pipeline herein presented represents a proof of principle showing that a portion between 4 and 11% of lncRNAs retains region of conservation between mammals and fishes. We believe this study will result useful as a reference to analyze the conservation of lncRNAs in newly sequenced genomes and transcriptomes. \uc2\ua9 2013 Basu et al.; licensee BioMed Central Ltd

Springer - Publisher Connector

University of Birmingham Research Portal

Sissa Digital Library

The Tetraodon nigroviridis reference transcriptome: Developmental transition, length retention and microsynteny of long non-coding RNAs in a compact vertebrate genome

Author: A Kapusta
A Necsulea
A Pauli
A Stabenau
AJ Vilella
AR Quinlan
B Maher
C Nepal
C Trapnell
C Weaver
CA Watson
CM Smith
D Kim
DR Kelley
F Pelegri
G St. Laurent
GT Williams
H Aanes
H Hezroni
H Roest Crollius
H Roest Crollius
H Tilgner
I Ulitsky
J Harrow
J Kim
J Ponjavic
J Ruiz-Orera
J-W Nam
JB Brown
M Blanchette
M Chorev
M Lohse
MD Robinson
MN Cabili
NT Ingolia
O Jaillon
P Flicek
P Heyn
P Miura
R Arrial
RC Gentleman
S Aparicio
S Basu
S Brenner
S Durinck
S Mathavan
SA Harvey
SS Paranjpe
T Derrien
T Kino
TR Dreszer
V Haberle
W Tadros
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Pufferfish such as fugu and tetraodon carry the smallest genomes among all vertebrates and are ideal for studying genome evolution. However, comparative genomics using these species is hindered by the poor annotation of their genomes. We performed RNA sequencing during key stages of maternal to zygotic transition of Tetraodon nigroviridis and report its first developmental transcriptome. We assembled 61,033 transcripts (23,837 loci) representing 80% of the annotated gene models and 3816 novel coding transcripts from 2667 loci. We demonstrate the similarities of gene expression profiles between pufferfish and zebrafish during maternal to zygotic transition and annotated 1120 long non-coding RNAs (lncRNAs) many of which differentially expressed during development. The promoters for 60% of the assembled transcripts result validated by CAGE-seq. Despite the extreme compaction of the tetraodon genome and the dramatic loss of transposons, the length of lncRNA exons remain comparable to that of other vertebrates and a small set of lncRNAs appears enriched for transposable elements suggesting a selective pressure acting on lncRNAs length and composition. Finally, a set of lncRNAs are microsyntenic between teleost and vertebrates, which indicates potential regulatory interactions between lncRNAs and their flanking coding genes. Our work provides a fundamental molecular resource for vertebrate comparative genomics and embryogenesis studies

University of Birmingham Research Portal

KITopen

Sissa Digital Library

Late Replicating Domains Are Highly Recombining in Females but Have Low Male Recombination Rates: Implications for Isochore Evolution

Author: A Cox
A Necsulea
BL Dumont
C Schmegner
C Schmegner
C-L Chen
Catherine J. Pink
CJ Pink
CJ Pink
CM Malcom
CM Ramsdell
D Karolchik
DJ Gaffney
E Yaffe
G Marais
G McVicker
G Piganeau
GE Magni
GE Magni
GI Lang
H Ellegren
I Hellmann
I Hiratani
J Berglund
J Meunier
J Perry
J-V Chamary
JA Stamatoyannopoulos
JF Crow
JF Crow
JN Strathern
JT Eppig
K Tamura
K Woodfine
KD Makova
KH Wolfe
L Duret
L Duret
Laurence D. Hurst
M Brudno
M Costantini
M Touchon
M-C Marsolier-Kergoat
MI Jensen-Seaman
MJ Lercher
MJ Lercher
MT Webster
N Galtier
N Weddington
Pawel Michalak
PD Keightley
S Farkash-Amar
S Ptak
S Shifman
S Tyekucheva
TC Brown
TR Dreszer
WH Li
Y Clément
Y Watanabe
Publication venue: Public Library of Science
Publication date: 20/09/2011
Field of study

In mammals sequences that are either late replicating or highly recombining have high rates of evolution at putatively neutral sites. As early replicating domains and highly recombining domains both tend to be GC rich we a priori expect these two variables to covary. If so, the relative contribution of either of these variables to the local neutral substitution rate might have been wrongly estimated owing to covariance with the other. Against our expectations, we find that sex-averaged recombination rates show little or no correlation with replication timing, suggesting that they are independent determinants of substitution rates. However, this result masks significant sex-specific complexity: late replicating domains tend to have high recombination rates in females but low recombination rates in males. That these trends are antagonistic explains why sex-averaged recombination is not correlated with replication timing. This unexpected result has several important implications. First, although both male and female recombination rates covary significantly with intronic substitution rates, the magnitude of this correlation is moderately underestimated for male recombination and slightly overestimated for female recombination, owing to covariance with replicating timing. Second, the result could explain why male recombination is strongly correlated with GC content but female recombination is not. If to explain the correlation between GC content and replication timing we suppose that late replication forces reduced GC content, then GC promotion by biased gene conversion during female recombination is partly countered by the antagonistic effect of later replicating sequence tending increase AT content. Indeed, the strength of the correlation between female recombination rate and local GC content is more than doubled by control for replication timing. Our results underpin the need to consider sex-specific recombination rates and potential covariates in analysis of GC content and rates of evolution

Deep RNA Sequencing Reveals Novel Cardiac Transcriptomic Signatures for Physiological and Pathological Hypertrophy

Author: A Mortazavi
AA Reyes
AC Eklund
AN Ladd
BJ van den Bosch
C Bolte
C Ikebe
C Yan
Christian Schönbach
CL Galindo
CL Himeda
CS Hong
D Catalucci
DM Henderson
Do Han Kim
DW Chan
E DesJardins
E Tanaka
F Damilano
F Soncin
GA Rezniczek
H Cha
H Kogo
Hong Ki Song
I Komuro
J Han
J Yang
JC Marioni
JH Lee
JJ Hunter
JM Johnson
JR McMullen
JY Park
K Asanuma
KJ Davies
KR Chien
M Fryknäs
M Iemitsu
M Kanehisa
M U
M Zhang
M Zhao
MJ McGrath
MJ Okoniewski
P Ahuja
Q Pan
R Arndt-Marić
R Berger
RZ Vencio
S Boulkroun
S Brown
S Gupta
S Lee
S Ramakrishna
S Ueno
SA Bossone
SE Brown
Seong-Eui Hong
SJ Matkovich
SN Wontakal
T Casneuf
T Yamazaki
Taeyong Kim
TL Bailey
TR Dreszer
V Beisvag
V Trichet
Y Hayashi
Z Fu
Publication venue: Public Library of Science
Publication date: 16/04/2012
Field of study

Although both physiological hypertrophy (PHH) and pathological hypertrophy (PAH) of the heart have similar morphological appearances, only PAH leads to fatal heart failure. In the present study, we used RNA sequencing (RNA-Seq) to determine the transcriptomic signatures for both PHH and PAH. Approximately 13–20 million reads were obtained for both models, among which PAH showed more differentially expressed genes (DEGs) (2,041) than PHH (245). The expression of 417 genes was barely detectable in the normal heart but was suddenly activated in PAH. Among them, Foxm1 and Plk1 are of particular interest, since Ingenuity Pathway Analysis (IPA) using DEGs and upstream motif analysis showed that they are essential hub proteins that regulate the expression of downstream proteins associated with PAH. Meanwhile, 52 genes related to collagen, chemokines, and actin showed opposite expression patterns between PHH and PAH. MAZ-binding motifs were enriched in the upstream region of the participating genes. Alternative splicing (AS) of exon variants was also examined using RNA-Seq data for PAH and PHH. We found 317 and 196 exon inclusions and exon exclusions, respectively, for PAH, and 242 and 172 exon inclusions and exclusions, respectively for PHH. The AS pattern was mostly related to gains or losses of domains, changes in activity, and localization of the encoded proteins. The splicing variants of 8 genes (i.e., Fhl1, Rcan1, Ndrg2, Synpo, Ttll1, Cxxc5, Egfl7, and Tmpo) were experimentally confirmed. Multilateral pathway analysis showed that the patterns of quantitative (DEG) and qualitative (AS) changes differ depending on the type of pathway in PAH and PHH. One of the most significant changes in PHH is the severe downregulation of autoimmune pathways accompanied by significant AS. These findings revealed the unique transcriptomic signatures of PAH and PHH and also provided a more comprehensive understanding at both the quantitative and qualitative levels

The Impact of Recombination on Nucleotide Substitutions in the Human Genome

Author: A Eyre-walker
A Eyre-walker
A Eyre-walker
A Eyre-walker
A Khelifi
A Kong
AE Vinogradov
AE Vinogradov
AJ Jeffreys
AP Bird
B Boussau
B Charlesworth
BC Lamb
C Coulondre
CA Bill
CC Spencer
CC Spencer
CH Langley
DA Filatov
DA Filatov
DB Kaback
DG Hwang
DJ Begun
E Belle
E Birney
E Birney
ES Lander
F Pardo-manuel De Villena
FC Chen
G Bernardi
G Bernardi
G Coop
G D'onofrio
G Marais
G Marais
G Marais
H Ellegren
I Gordo
I Hellmann
J Felsenstein
J Filipski
J Filipski
J Meunier
JA Wilder
JJ Bussell
JL Gerton
JN Strathern
JT Chang
K Holloway
KH Wolfe
KJ Fryxell
L Duret
L Duret
Laurent Duret
M Lipatov
MJ Lercher
MJ Lercher
MJ Lercher
MK Rudd
Molly Przeworski
MP Francino
MT Webster
MT Webster
MT Webster
N Galtier
N Galtier
N Galtier
N Patterson
NH Barton
Peter F. Arndt
PF Arndt
PF Arndt
PF Arndt
PF Arndt
PR Haddrill
R Sachidanandam
RD Hernandez
RH Waterston
RM Kliman
S Hughes
S Kuraku
S Myers
S Myers
SE Ptak
T Nagylaki
TC Brown
TD Petes
TM Collins
TR Dreszer
W Winckler
WH Press
Y Blat
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Unraveling the evolutionary forces responsible for variations of neutral substitution patterns among taxa or along genomes is a major issue for detecting selection within sequences. Mammalian genomes show large-scale regional variations of GC-content (the isochores), but the substitution processes at the origin of this structure are poorly understood. We analyzed the pattern of neutral substitutions in 1 Gb of primate non-coding regions. We show that the GC-content toward which sequences are evolving is strongly negatively correlated to the distance to telomeres and positively correlated to the rate of crossovers (R2 = 47%). This demonstrates that recombination has a major impact on substitution patterns in human, driving the evolution of GC-content. The evolution of GC-content correlates much more strongly with male than with female crossover rate, which rules out selectionist models for the evolution of isochores. This effect of recombination is most probably a consequence of the neutral process of biased gene conversion (BGC) occurring within recombination hotspots. We show that the predictions of this model fit very well with the observed substitution patterns in the human genome. This model notably explains the positive correlation between substitution rate and recombination rate. Theoretical calculations indicate that variations in population size or density in recombination hotspots can have a very strong impact on the evolution of base composition. Furthermore, recombination hotspots can create strong substitution hotspots. This molecular drive affects both coding and non-coding regions. We therefore conclude that along with mutation, selection and drift, BGC is one of the major factors driving genome evolution. Our results also shed light on variations in the rate of crossover relative to non-crossover events, along chromosomes and according to sex, and also on the conservation of hotspot density between human and chimp

INRIA a CCSD electronic archive server

Brunel University Research Archive

HAL Descartes

MPG.PuRe

Recommended from our members

Author Correction: Expanded encyclopaedias of DNA elements in the human and mouse genomes

Author: Abascal F
Acosta R
Addleman NJ
Adrian J
Afzal V
Aken B
Akiyama JA
Amrhein H
Anderson SM
Andrews GR
Antoshechkin I
Ardlie KG
Armstrong J
Astley M
Banerjee B
Barkal AA
Barnes IHA
Barozzi I
Barrell D
Barson G
Bates D
Baymuradov UK
Bazile C
Beer MA
Beik S
Bender MA
Bennett R
Bernstein BE
Berry A
Bhaskar A
Bignell A
Blue SM
Bodine DM
Boix C
Boley N
Borrman T
Borsari B
Bouvrette LPB
Boyle AP
Brandsmeier LA
Breschi A
Bresnick EH
Brooks JA
Buckley M
Burge CB
Byron R
Cahill E
Cai L
Cao L
Carty M
Castanon RG
Castillo A
Chaib H
Chan ET
Chee DR
Chee S
Chen H
Chen H
Chen JY
Chen S
Cherry JM
Chhetri SB
Choudhary JS
Chrast J
Chung D
Clarke D
Cody NAL
Coppola CJ
Coursen J
Dalton S
Danyko C
Davidson C
Davila-Velderrain J
Davis CA
Dekker J
Deran A
DeSalvo G
Despacio-Reyes G
Dewey CN
Dickel DE
Diegel M
Diekhans M
Dileep V
Ding B
Djebali S
Dobin A
Dominguez D
Donaldson S
Drenkow J
Dreszer TR
Drier Y
Duff MO
Dunn D
D’Ippolito AM
Eastman C
Ecker JR
Edwards MD
El-Ali N
Elhajjajy SI
Jammal OA
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/05/2022
Field of study

Online Correction for: https://doi.org/10.1038/s41586-020-2493-4 | Erratum for https://bura.brunel.ac.uk/handle/2438/21299In the version of this article initially published, two members of the ENCODE Project Consortium were missing from the author list. Rizi Ai (Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, USA) and Shantao Li (Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA) are now included in the author list. These errors have been corrected in the online version of the article : 'Expanded encyclopaedias of DNA elements in the human and mouse genomes'.https://www.nature.com/articles/s41586-021-04226-3https://www.nature.com/articles/s41586-021-04226-

Expression and regulation of long noncoding RNAs in TLR4 signaling in mouse macrophages

Author: A Pauli
AC Marques
Ai-Ping Mao
AR Quinlan
B Langmead
C Plessy
F Santa De
G Hu
G Natoli
GD Barish
H Jia
H Li
J Feng
JL Rinn
JR Alvarez-Dominguez
JTY Kung
Jun Shen
KA Fitzgerald
KC Wang
KD Pruitt
LK Ellertsen
LX Garmire
M Guttman
M Shumway
MF Melgar
MN Cabili
N Raghavachari
NE IIott
P Carninci
P Flicek
P Gellert
Q Liao
Q Liao
R Kolde
R Yamashita
RK Dave
S Carpenter
T Barrett
T Derrien
T Kawai
T Shiraki
T-K Kim
TR Dreszer
TR Mercer
UA Ørom
W Xu
WJ Kent
WN Venables
X Zhang
Y Zhang
Z Du
Z Li
Zhixiang Zuo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Cloning, annotation and developmental expression of the chicken intestinal MUC2 gene

Author: A Heger
Amy C. Lossie
CK Heazlewood
CM Byrne
CT Collier
D Ambort
D Ambort
DA Benson
DM Karcher
F Escande
G Xu
GC Hansson
JA Chambers
JD Bendtsen
JL Desseyn
JM Larsson
JR Gum
JR Gum Jr
K Godl
K Watanabe
KD Pruitt
LR Sternberg
M Kozak
M Van der Sluis
MA Hollingsworth
MA McGuckin
ME Johansson
ME Johansson
ME Johansson
ME Lidell
MP Buisine
MY Galperin
N Asker
N Burger-van Paassen
N Burger-van Paassen
NG Karlsson
P Lu
P Wang
PE Boardman
PR Hoorens
PY Tam
S Hunter
Sebastian D. Fugmann
SZ Hasnain
T Lang
T Lang
Todd J. Applegate
TR Dreszer
WJ Kent
Y Matsuoka
YH Jeong
Z Jiang
Z Uni
Zhengyu Jiang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Intestinal mucin 2 (MUC2) encodes a heavily glycosylated, gel-forming mucin, which creates an important protective mucosal layer along the gastrointestinal tract in humans and other species. This first line of defense guards against attacks from microorganisms and is integral to the innate immune system. As a first step towards characterizing the innate immune response of MUC2 in different species, we report the cloning of a full-length, 11,359 bp chicken MUC2cDNA, and describe the genomic organization and functional annotation of this complex, 74.5 kb locus. MUC2 contains 64 exons and demonstrates distinct spatiotemporal expression profiles throughout development in the gastrointestinal tract; expression increases with gestational age and from anterior to posterior along the gut. The chicken protein has a similar domain organization as the human orthologue, with a signal peptide and several von Willebrand domains in the N-terminus and the characteristic cystine knot at the C-terminus. The PTS domain of the chicken MUC2 protein spans ~1600 amino acids and is interspersed with four CysD motifs. However, the PTS domain in the chicken diverges significantly from the human orthologue; although the chicken domain is shorter, the repetitive unit is 69 amino acids in length, which is three times longer than the human. The amino acid composition shows very little similarity to the human motif, which potentially contributes to differences in the innate immune response between species, as glycosylation across this rapidly evolving domain provides much of the musical barrier. Future studies of the function of MUC2 in the innate immune response system in chicken could provide an important model organism to increase our understanding of the biological significance of MUC2 in host defense and highlight the potential of the chicken for creating new immune-based therapies